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rsq ■ Asymptotic properties of the local Whittle estimator in the non- 

stationary case (d > i) are explored. For i < d < 1, the estimator is 
shown to be consistent, and its limit distribution and the rate of con- 
vergence depend on the value of d. For d= 1, the limit distribution 

t/J I is mixed normal. For d > 1 and when the process has a polynomial 

(~| ■ trend of order a > i, the estimator is shown to be inconsistent and 

+H ' to converge in probability to unity. 

1. Introduction. Semiparanietric estimation of the memory parameter 
(d) in fractionally integrated [1(d)] time series has attracted much recent 
^ ' study and is attractive in empirical applications because of its general treat- 

ed . ment of the short memory component. Two commonly used semiparametric 

estimators are log periodogram (LP) regression and local Whittle estimation. 
\^ ■ LP regression is popular mainly because of the simplicity of its construction 

Cp . as a linear regression estimator. Local Whittle estimation involves numerical 

methods but is more efficient than LP regression. The local Whittle estima- 
tor was proposed by Kiinsch (1987) and Robinson (1995) showed its con- 
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"^ . sistency and asymptotic normality for dG (— ^,^). Velasco (1999) extended 

2 ! Robinson's results to show that the estimator is consistent for d G ("2' ^) 

C ' and asymptotically normally distributed for d G (— 2' f)' 
K»" ■ The present paper studies the asymptotic properties of the local Whittle 

k>( I estimator in the nonstationary case for d> ^^ including the unit root case 

^ • and the case where the process has a polynomial time trend. These cases are 

^ of high importance in empirical work especially with economic time series, 
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which commonly exhibit nonstationary behavior and show some evidence 
of deterministic trends as well as long range dependence. The asymptotic 
properties of the local Whittle estimator in the nonstationary case over 
the region de (^,1) were explored in Velasco (1999). Velasco also showed 
that, upon adequate tapering of the observations, the region of consistent 
estimation of d may be extended but with corresponding increases in the 
variance of the limit distribution. For the region d>l, there is presently no 
theory for the untapered Whittle estimator and, for the region d S (|, 1), no 
limit distribution theory. The unit root case is of particular interest because 
it stands as an important special case of an I[d) process with d= 1 and it 
has played a central role in the study of nonstationary economic time series. 
It is also now known to be the borderline that separates cases of consistent 
and inconsistent estimation by LP regression [Kim and Phillips (1999)] and, 
as we shall show here, local Whittle estimation. 

This paper demonstrates that the local Whittle estimator (i) is consistent 
for d G (^,1], (ii) is asymptotically normally distributed for d E (|, |), (iii) 
has a non-normal limit distribution for d G [f ) l)j (iv) has a mixed normal 
limit distribution for d = l, (v) converges to unity in probability for d > 1 
and (vi) converges to unity in probability when the process has a polynomial 
time trend of order a> 2- The present paper, therefore, complements the 
earlier work of Robinson (1995) and Velasco (1999) and largely completes 
the study of the asymptotic properties of the local Whittle estimator for 
regions of d that are empirically relevant in most applications. The paper 
also serves as a counterpart to Phillips (1999b) and Kim and Phillips (1999), 
which analyze the asymptotics of LP regression for d£ (2)2). 

The approach in the present paper draws on an exact representation and 
approximation theory for the discrete Fourier transform (d.f.t.) of nonsta- 
tionary fractionally integrated processes. The theory, developed by Phillips 
(1999a), employs a model for nonstationary fractionally integrated processes 
that is valid for all values of d and provides a uniform apparatus for analyz- 
ing the asymptotic behavior of their d.f.t. 's. 

The remainder of the paper is organized as follows. Section 2 introduces 
the model. Consistency of the local Whittle estimator for d € {2 A] a-nd its 
inconsistency for d > 1 are demonstrated in Section 3. Section 4 derives 
the limit distributions. Results for fractionally integrated processes with 
a polynomial time trend are given in Section 5. Section 6 reports some 
simulation results and gives an empirical application using economic data. 
Section 7 makes some brief remarks on the important practical issue of 
finding a good general purpose estimator of d when nonstationarity in the 
data is suspected. Some technical results are collected in Appendix A. Proofs 
are given in Appendix B. 
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2. Preliminaries. We consider the fractional process Xt generated by the 
model 

(1) {\-LY{Xt-Xo)=ut, t = 0,l,2,..., 

where Xq is a random variable with a certain fixed distribution. Our interest 
is in the case where Xt is nonstationary and d> ^^ so in (1) we work from 
a given initial date i = 0, set Uf = for all t < and assume that ut, t> 1, 
is stationary with zero mean and spectral density /m(A). Expanding the 
binomial in (1) gives the form 

(2) i:^-^{xt^k-xo)=ut, 



fe=0 



where 



id)k = ^^ = {d){d+l)---{d + k-l) 



is Pochhammer's symbol for the forward factorial function and r(-) is the 
gamma function. When d is a positive integer, the series in (2) terminates, 
giving the usual formulae for the model (1) in terms of the differences and 
higher order differences of Xt- An alternate form for Xt is obtained by 
inversion of (1), giving a valid representation for all values of d, 

t- 

' k\ 



(3) Xt = {l- Lr% + Xo = Y. ^^t-k + Xo 



k=0 

Define the discrete Fourier transform and the periodogram of a time series 
at evaluated at the fundamental frequencies as 



Wa{Xs) = f- — y" ate'^^" , \s = , s = 1, . . . , n. 



n 



1 V^ Jt\s • 27rs 

(4) V2^ tr{ 

Ia{\s) = \Wa{\s)?- 

The model (1) is not the only model of nonstationary fractional integra- 
tion. Another model that is used in the literature forms a process Xt with 
d G [|) |) from the partial sum of a stationary long-range dependent process, 
as in 

t 

(5) Xt = Y.Uk+Xo, de[i,|), 

fc=i 

where Ut has spectral density /(A) ~ Gq\~'^^'^~^' as A ^ 0. Model (5) applies 
for the specific range of values d e [^, |) and this can be extended by repeated 
use of partial summation in the definition. Model (1) directly provides a valid 
model for all values of d. Some interest in (1) has already been shown in the 
literature [e.g., Marinucci and Robinson (2000) and Robinson and Marinucci 
(2001)]. 
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3. Local Whittle estimation: consistency for d < 1 and inconsistency for 

d > 1. Local Whittle (Gaussian semiparanietric) estimation was developed 
by Kiinsch (1987) and Robinson (1995). Specifically, it starts with the fol- 
lowing Gaussian objective function, defined in terms of the parameter d and 
G: 

-, m r \2d 

1 ^r-r, 2d, , ^j 



log(GA-^^) + -^I.(A, 



(6) Q^^G,d) = -J2 

where m is some integer less than n. The local Whittle procedure estimates 
G and d by minimizing Qm{G,d), so that 

{G,d)= argmin Qm{G,d), 

GG(0,oo),dG[Ai,A2] 

where Ai and A2 are numbers such that — 2 < Ai < A2 < 00. It will be 
convenient in what follows to distinguish the true values of the parameters 
by the notation Gq = /n(0) and d^. Concentrating (6) with respect to G as 
in Robinson (1995) gives 



d= argmin R{d), 

de[Ai,A2] 



where 



m 



R{d) = logG{d) - 2d— ^log Aj, 

We now introduce the assumptions on m and the stationary component 
ut in (1). 

Assumption 1. 

/4A)~A(0)G(0,oo) asA^O + . 

Assumption 2. In a neighborhood (0,5) of the origin, /u(A) is differ- 
entiable and 

^log/„(A) = 0(A-i) asA^O + . 

Assumption 3. 

00 00 

(7) Ut = C{L)et = ^ CjEt-^j, ^ c| < 00, 

j=o j=o 
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where £'(ej|Fj_i) = 0, S(e^|Ft_i) = 1 a.s., t = 0,±1,..., in which Ft is the 
(7-field generated by e^, s <t, and there exists a random variable e such that 
Ee"^ < oo and for ah r/ > and some K > 0, Pr(|ej| > rj) < i^Pr(|e| > r]). 



Assumption 4. 



1 m 

1 >0 asn^oo. 

m n 



Assumptions 1-3 are analogous to Assumptions A1-A3 of Robinson (1995). 
However, we impose them in terms of ut rather than Xf. Assumption 4 is 
the same as Assumption A4 of Robinson (1995). 

Lemma A. 1(a) in Appendix A gives the following expression for Wx{Xs): 

1-e* 






(8) 

e^^= Xn-Xo 1 C/A,n(e) 



l-e*^- V2to l-e»^- ^/2 



vrn 



Neglecting the third term of (8) as a remainder, Wx{Xs) is seen to comprise 
two terms — a function of the d.f.t. of ut and a function of X„. As the value 
of d changes, the stochastic magnitude of the two components changes, and 
this influences the asymptotic behavior of Wx{Xs)- When d < 1, the first 
term dominates the second term and Wx{Xs) behaves like X~'^Wu{X.s), being 
asymptotically uncorrelated for different frequencies. When d>l, the sec- 
ond term becomes dominant and Wx{Xs) behaves like Xj^{Xn — A'o)/\/27rn, 
being perfectly correlated across all A^. This switching behavior of Wx{Xs) 
at d = 1 is a key determinant of the asymptotic properties of the local Whit- 
tle estimator, as well as other procedures like LP regression. When d=l, 
the two terms have the same stochastic order and this leads to a form of 
asymptotic behavior that is particular to this case. 

Theorem 3.1 below establishes that d is consistent for do G (25!] ^^"^ 
hence consistency carries over to the unit root case. While G is consistent 
for do G (2)!)) however, it is inconsistent and tends to a random quantity 
when do = 1- 

Theorem 3.1. Suppose Xt is generated by (1) with do £ [Ai,A2] and 
Assumptions 1-4 hold. Then, for do £ (o' ^l' d^ d^ as n^ 00, and 



G{d) 



Go, for do £(1,1), 

Goil + xl), for do = I. 



When do > 1, d manifests very different behavior. It converges to unity 
in probability and the local Whittle estimator becomes inconsistent. So the 
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local Whittle estimator is biased downward even in very large samples when- 
ever the true value of d is greater than unity. Kim and Phillips (1999) showed 
that the LP regression estimator also converges to unity when dQ> 1. 

Theorem 3.2. Under the same conditions as Theorem 3.1, for do £ 
(1, M] with 1 < M <(X), d-^ 1 as n^oo. 

Remark 3.3. Velasco (1999) showed that d is consistent for do G (^, 1) 
using the model (5). We conjecture that our consistency and inconsistency 
results for the local Whittle estimator for do = 1 and do G (1, |) continue to 
hold under (5). 

4. Local Whittle estimation: asymptotic distribution. We introduce some 
further assumptions that are used in the results of this section. 

Assumption 1'. For some j3 G (0,2], 

fuiX) = /«(0)(1 + 0(A^)), A(0) G (0, oo) as A ^ + . 

Assumption 2'. In a neighborhood (0,(5) of the origin, C(e*'^) is differ- 
entiable and 

Ac(e*^) = 0(A-i) asA^O + . 

CLA 

Assumption 3'. Assumption 3 holds and also 
Eiel\Ft.i) = fi3, 

£'(ej|Ft_i) = /i4 a.s., i = 0,±l,..., 
for finite constants ^3 and /i4. 

Assumption 4'. Asn^oo, 

1 771^+^^ (log m)^ 

1 ^^ — - — > 0. 

m rfll^ 

Assumption 5'. Uniformly in A: = 0, 1, ... , 

Y: 7. = 0((log(A; + 1))-^), Y. ^3 = 0{{log{k + 1))-^), 

j>k j>k 

'yj = EutUt+j. 
Assumption 6'. For the same f3 G (0, 2] as in Assumption 1' and A, A' G 

\C{e'^) - C{e'^')\ <C\\- X'l^'^'^iM^ c G (0, 00). 
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Assumptions l'-4' are analogous to Assumptions Al'-A4' of Robinson 
(1995), except that our assumptions are in terms of ut rather than Xt. 
When do G (2)!)) we need an additional assumption, Assumption 5', that 
controls the behavior of the tail sum of Cj and 7^. This assumption seems 
to be fairly mild. For instance, consider the stationary Gegenbauer process 
proposed by Gray, Zhang and Woodward (1989): 

ut = {l-2aL + L^Y^et = C{L)eu 

t = 0,±l,±2,..., 

with \a\ < 1 and b G (0, ^). Its spectral density is /u(A) = {4(cos A — a)^}~^/27r, 
which has a fractional pole at Aq = cos^^ a. The asymptotic approximations 
for Cj and -jj are given by [Gray, Zhang and Woodward (1989), pages 236- 

238] 

Cj ~ Ai(a, b) cos{(j + 6)Ao - bTr/2}j^-\ 
(9) 

7j~A2(a,6)i2f' isin(7r6-jAo), 

as j — > 00, where Ai(a,5) and A2(a,6) do not depend on j. Since Cj and 
Pj satisfy Assumption 5' [Zygmund (1959), Theorem 2.2, page 3], Assump- 
tion 5' allows for a pole and discontinuity in /^(A) at A 7^ 0. However, As- 
sumption 5' is not satisfied if 7a: = (A: + l)~^(log(A; + 1))~^. When do = 1) 
Assumption 5' is not necessary, but instead we need Assumption 6'. It re- 
quires C(e ) to be Lip(min{/3, 1}) in the neighborhood of the origin. 

The following theorems establish the asymptotic distribution of the local 
Whittle estimator for do G i^A]- When do ^ {^,j), d is asymptotically nor- 
mally distributed, but d has a nonnormal limit distribution and slower rate 
of convergence when do £ [-g,l). This phenomenon occurs because, when do 
is large, the stochastic magnitude of X„ in the representation (8) becomes 
so large that it dominates the behavior of d. 

Theorem 4.1. Suppose Xt is generated by (1) with do G (Ai, A2) and 
Assumptions l'-5' hold. Then 

m^/\d-do)^^U 

m^^^d- do) ^lU + J{do)W^ 

m^-'ido{d-do)^J{do)W'^ 

where J{do) = {2TTf'^°-'^T{do)-'^{2do - 1)-3(1 - do) and U and W are mu- 
tually independent N{0, 1) random variables. 



for 


do 


e(i 


4^ 


for 


■do 


3 

~ 4' 




for 


do 


^(i 


>1), 
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When do = 1 the two main components of Wx{Xs), that is, Wu{Xs) and 
X„/\/2vrn, have the same stochastic magnitude, and the hmit distribution 
of the local Whittle estimator turns out to be mixed normal (denoted as 
MN). Intriguingly, the variance of d becomes smaller than the case where 
do < 1, as was found in the corresponding case for LP regression [Phillips 
(1999b)]. 

Theorem 4.2. Suppose Xt is generated by (1) with dg = 1 € (Ai, A2) 
and Assumptions l'-4' and 6' hold. Then 

m^/'^{d- do) ^ MN(0, a'^{W)) 



N{0,a'{h))(t>{h)dh, 
where W is iV(0, 1), (j){-) is standard normal p.d.f. and 

^(^) = 41 + 2/.2 + /,4- 

Remark 4.3. (a) When do = 1, the variance of the limit distribution 
of m}''^{d — do) is less than ^ since o"^(/i) < ^ almost surely. Numerical 
evaluation gives 

= 0.2028. 

Thus, the limit distribution of the local Whittle estimator has less dispersion 
when do = 1 than it does in the stationary and do S (211) cases. A similar 
phenomenon applies in the limit theory for LP regression, where again the 
limit distribution is mixed normal when do = 1 [Phillips (1999b)]. 

(b) Velasco (1999) shows asymptotic normality of the estimator for do G 
(|, |) using the model (5). We conjecture that the estimator has the same 
asymptotic distributions as those given above for do € (|,1] under (5), pos- 
sibly with different J(do), although the limit distribution for do = | might 
be difficult to derive. 

5. Fractional integration with a polynomial time trend. In many ap- 
plications, a nonstationary process is accompanied by a deterministic time 
trend. Accordingly, this section extends the analysis above to fractional pro- 
cesses with an a-order (a > 0) polynomial deterministic time trend. Specif- 
ically, the process Xt is generated by the model 

Xt = x^ + Xo + /it° = (1 - ly^ut + Xo + /it" 

(10) *^i (d)i 

= Y.^ut-k + Xo + fie, i = 0,l,2,..., /i/0, 

fc=o '^- 
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where Xq and ut are defined as above. As shown in Appendix A, the d.f.t. of a 
time trend takes the following form, uniformly for 1< s <m with m = o{n): 

1 " 



v27rn 



e 
t=i 

1 7 



= [1 + 0(1)]. 



1 - e*^= y/2iTn 

[See also Corbae, Ouliaris and Phillips (2002), who give exact formulae for 
d.f.t. 's of a time trend when a is a positive integer.] Therefore, neglecting the 
remainder term and U\^n{^)^ we obtain the following expression of Wx{^s)- 

1 - e*-^= V27rn 1 - e*^" 

(11) l-e*^» ^2™ 

+ Op(A7'^) + Op(A;in'^-i). 

When a > |, the second term in (11) is dominated by either the first term 
(if a > d — 2) or the third term (if 2 < a < c? — 2)) ^'^'^ then Wx{\s) behaves 
like C(n)A7^, where C{n) does not depend on s. As a result, d converges to 
unity in probability, and the local Whittle estimator is inconsistent except 
when the true value do = l. Since X^ = Op{n ''^), this result might be re- 
garded as an instance of a deterministic trend dominating a stochastic trend 
when a > d— 2- In the present case, because the d.f.t. of a deterministic trend 
is governed by the final observation, n", the outcome for unfiltered, unta- 
pered data is the inconsistency of d. In consequence, some caution is needed 
in applying the Whittle estimator to investigate the degree of long range 
dependence when a time series exhibits trending behavior involving a deter- 
ministic trend of uncertain order. The same result holds if the deterministic 
trend kt is fractionally integrated in the sense that (1 — L)°^kt = I{t > 1}, 
because then /c„ ~ r(Q + l)~^n°, as shown in Appendix A. 

Theorem 5.1. Suppose Xt is generated by (10) with do G [Ai, A2], a > 
2 , and Assumptions 1-4 hold. Then, for do G (-^jM] with 1 < M < 00, (i ^ 1 
as n —> 00 . 

6. Simulations and an empirical application. First we report simulations 
that were conducted to examine the finite sample performance of the local 
Whittle estimator using (1) with ut ~ i.i.d. A^(0, 1). All the results are based 
on 10,000 replications. 
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Table 1 
Simulation results for rf = 0.7 and d = 1.0 







d = 0.7 




d 


= 1.0 




n 


bias 


s.d. 


t.s.d.* 


bias 


s.d. 


t.s.d. 


200 

500 

1000 


0.0002 
0.0093 
0.0101 


0.1977 
0.1451 
0.1162 


0.1336 
0.1066 
0.0898 


-0.0235 
-0.0129 
-0.0102 


0.1779 
0.1280 
0.1019 


0.1204 
0.0960 
0.0809 



* t.s.d. denotes theoretical standard deviation. 



Table 1 shows the simulation results for d = 0.7 and d= 1.0. The sample 
size and m were chosen to be n = 200, 500, 1000 and m = [inP-^]. The estima- 
tor is seen to have smaller standard deviation when d= 1.0, corroborating 
the asymptotic theory. 

Figure 1 plots the empirical distribution of the estimator for d = 0.7, 
0.9, 1.0, 1.5 when n = 500 and m = [v?'^]. The estimator appears to have 
a symmetric distribution when (i < 1, and the positive bias and skewness 
of the limit distribution for d = 0.9 are not evident for this sample size. 
When (i > 1, distribution of the estimator is concentrated around unity, 
again corroborating the asymptotic result. 

As an empirical illustration, the local Whittle estimator was applied to 
the historical economic time series considered in Nelson and Plosser (1982) 
and extended by Schotman and van Dijk (1991). We also estimate d by 
first taking differences of the data, estimating d — 1 and adding unity to 
the estimate d — 1. This procedure is consistent for ^ < d < 2 and invariant 



to a linear trend. Table 2 shows the estimates based on both m - 



■ n 



0.5 



and 




Fig. 1. Densities of the local Whittle estimator: n = 500, m = n° 
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Table 2 
Estimates of d for US economic data 







m = 


„0.5 

: n 


m = 


„0.6 

n 




n 


disw 


C^LWD 


disw 


t^LWD 


Real GNP 


62 


0.990 


0.626 


0.946 


0.719 


Nominal GNP 


62 


0.983 


0.901 


0.930 


0.909 


Real per capita GNP 


62 


0.976 


0.631 


0.912 


0.728 


Industrial production 


111 


0.918 


0.516 


0.968 


0.593 


Employment 


81 


1.001 


0.660 


0.977 


0.713 


Unemployment rate 


81 


0.507 


0.527 


0.705 


0.741 


GNP deflator 


82 


1.143 


0.973 


1.049 


1.099 


CPI 


111 


1.020 


1.227 


0.828 


1.176 


Nominal wage 


71 


1.080 


1.026 


1.015 


0.983 


Real wage 


71 


1.105 


0.785 


1.030 


0.822 


Money stock 


82 


1.042 


0.913 


0.993 


1.232 


Velocity of money 


102 


1.055 


0.932 


0.970 


0.782 


Bond yield 


71 


0.676 


1.261 


0.740 


1.370 


Stock prices 


100 


0.914 


0.860 


0.984 


0.755 



m = n"'^ . These series produce long memory estimates over a wide interval 
that ranges from around 0.5 for the unemployment rate to 1.38 for the bond 
yield. For the unemployment rate, the local Whittle estimate from the raw 
data {(Iysn) and the local Whittle estimate from the differenced data ((^lwd) 
are very close together, both indicating only marginal nonstationarity in 
the data. For the bond yield, c^lwd is very different from ^lw- Especially 
for the GNP measures, industrial production and employment, the presence 
of a linear trend component in the data [which is supported by much of 
the empirical work with this data set following Nelson and Plosser (1982)] 
appears to bias c^lw heavily toward unity. These particular results indicate 
that, although the local Whittle estimator is consistent for 2 < ^ ^ l? the 
use of differenced data or even data tapering [Velasco (1999) and Hurvich 
and Chen (2000)] may be preferable, unless the time series clearly does not 
involve a deterministic trend and values of d > 1 are not suspected. 

7. Concluding remarks. The results of the present paper have a negative 
character, revealing that the local Whittle estimator is not a good general 
purpose estimator when the value of d may take on values in the nonsta- 
tionary zone beyond | . The asymptotic theory is discontinuous at d = | and 
again at d = 1 , it is awkward to use and the estimator is inconsistent beyond 
unity. 

This paper has not explicitly addressed the issue of what semiparametric 
estimation procedure is a good general purpose procedure for possibly non- 
stationary cases. Data differencing and data tapering have been explored 
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[Velasco (1999) and Hurvich and Chen (2000)], are easy to implement and 
have been shown to extend the range of appHcabihty of the local Whittle 
estimator. However, these approaches do have some disadvantages, such as 
the need to determine the appropriate order of differencing and the effects 
of tapering on variance. Another approach is to use the exact form of the 
local Whittle estimator suggested in Phillips (1999a), which does not rely 
on differencing or tapering. This estimator has recently been shown by the 
authors [Shimotsu and Phillips (2002)] to be consistent and to have the same 
A'^(0, j) limit distribution for all values of d. While it is still too early for 
a definitive answer to the question of what is a good general purpose semi- 
parametric estimator of d that allows for nonstationarity, these approaches 
offer some useful alternatives for applied researchers, and the present paper 
is at least a cautionary tale about performance characteristics of the local 
Whittle estimator in the nonstationary environment. 

APPENDIX A 

Technical lemmas. In this and the following sections, x* denotes the 
complex conjugate of x, and |x|+ denotes max{x,l}. 

Lemma A.l [Phillips (1999a), Theorems 2.2 and 2.7]. (a) If Xt follows 
(1), then 

w,{X){l-e'^) = Dn{e'^;e)wu{\) 

(12) inX _ „iX 

\/2TTn \/2'Kn 

where Dn{e'^; 0) = ELo ^^e*^'^, 9 = 1 - d and 

n-l 



Uxn{e) = Dnx{e-'^L- 9)Un = E ^Ape-^P^ 

p=0 



'^n—pi 



(13) 

k=p+l '^■ 
(b) If Xf follows (1) with d=l, then 



jx 



(14) w,{X){l - e'^) = Wu{X) - -^^{e'^'^Xn - Xq). 

\/2-Kn 

Lemma A. 2. For 9 > —1 and uniformly in s = 1,2, ... ,m with m = o(n), 

(15) Dnie'^^;9) = (1 - e'^^f + O(n-^s^i). 



NONSTATIONARY LOCAL WHITTLE ESTIMATION 
Proof. We have 
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D„(e*^^;e)=f;Like^fcA. 



kl 



y> i-0)k ^ik\. 



(16) 



n+1 



A;! 






■e-2 



n+l 



.n+1 



since (-^)fc/A;! = r(-0)-^A:-^-Hl + 0(A;-i)) [Erdelyi (1953), page 47]. Be- 
cause 9 > —1 and s 7^ 0, the first term in (16) converges and equals to 
(1 — e*'*'")^ [Erdelyi (1953), page 57]. For the second term in (16), by Theorem 
2.2 of Zygmund [(1959), page 3] we have 



E^- 



-e-i^ik\s 



n+l 



< (n + l)~^^^max 



N 



n+N 

E« 

n+l 



ikXs 



= 0(n-^s-i). 

The third term in (16) is necessarily 0{n~^s~^) because Z]5^i^~^~^ 
0(?i-^-i). D 



Lemma A. 3. (a) A"^(l-e 



iX\ 



-(■K/2)ei 



+ 0(A) as A^0+. 



(b) For 6 > —1 and uniformly in s = 1, 2, . . . , m with m = o{n), 



(17) 



A;'n„(e 



i\s 



^(Tv/2)ei 



+ o{K) + o{s-'^' 



Proof. For (a), since |l-e*^| = |2sin(A/2)| and arg(l-e*^) = (A-7r)/2 
for < A < vr, we can write (1 — e ) in polar form as |2sin(A/2)| exp[i0(A — 
7r)/2]. It follows that 



A"'(l - e'^^y = \~%\ + O(A^))lexp(-i07r/2) + 0(A)] 



-{■K/2)ei 



+ 0(A) 



giving the stated result. Statement (b) follows from (a) and Lemma A. 2. D 

Lemma A. 4. Uniformly m p = 0, 1, . . . , n — 1 and s = 1, 2, . . . , m with 
m = o{n) : 

(a) 

(18) 



^A. 



|o(|p|: 

\ 0{n^' 



odpC^-i 



0{n'^^^] 



+ h 



for e>o, 

for 9 e (-1,0), 
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(b) 

(19) 



= 0(|p|^-2ns-i), fore>-l. 



Proof. Observe 



p+l \p+l / 



The required results follow from 



J^^_,_i^ f0(|p|;^), for0>O, 

\0{n'^), for 0G (-1,0), 



P+i 



Y.k-'-' <Y.k-'-\ 
p+l p+l 



p+l 



-e-l ikX, 



< (p+l)~^^^max 



TV 
1„„-1^ 



P+N 

E^ 
p+l 



ikXs 



0{\p\l'-'7lS 



andEp+ifc''-' = 0(W;^-i). D 

Lemma A. 5. (a) Under the assumptions of Theorem. 3.1, we have the 
following: 



(al) 



E\UxAO)\ =o{hns{e)), 



(a2) E{Xn-Xo-C{l)X'J' = o{n^''-^), 



uniformly in s = 1,2, . . . ,m, where X^^ = X]fc=o j^^n-k "-it- 



1 id)k , 



^l-26»g2e-l _^2d-lgl-2d 



/in.(e) = <(nl-2V^-l(log(s + l))' 



/or0G(-i i), 



^^2d-i^i-2dQQg(^^;L))^^ /on 



2' 



(b) Under the assumptions of Theorem 4.1, we have, uniformly in s 



l,...,m, 



E\Ux^r^{0)-C{l)exAO)r 

= 0{n'-^'s^'~Hlognr^ + n'-^'s-^) for G (-i, i). 
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Proof, (a) We prove (al) first. When = the stated result follows 
because Ux,n{0) = 0. When 6^/0 define Op = h^pe-'P^' so that Ux,n{0) = 
Y^pZo dpUn-p- We suppress the dependence of ap on 6 and A^. Summation 
by parts gives 



n-2 



n-1 



(20) 



U\^n{G) = ^(ap-ap+i)^n„_j + o„_i^- 



<-n-y 



p=0 



j=0 



j=0 



Observe that 

ap — Op+i 



{-^)k i{k-p)\. 



k\ 



^^iik-p)Xs 



k\ 



k=p+2 

E ■ 
/=p+i 



^ E 

fe=p+i 

n 

^ E ■ 

k=p+l 
n-1 

^ E 

k=p+l 



y^ {-0)k ^i(k-p-i)X, 



k\ 



'-0)l+l A{l-p)\, 

(/ + !)!' 



(— ^)fc _ (— 6')fc+i 
A:! (fc + 1)! 



^i(k-p)Xs _|_ r 



-ipAs 



n! 



+ iz^g-ipA. 



n! 



where the fourth line follows from 

{-d)k {-e)k+i 



k\ 



{k + l)\ 

T{k-e) 



[i + e)T{k-e) 



T{-e-l)T{k + 2) T{-e)T{k + 2) 



Define 



n-1 



_ (i + g)r(fc-g) ■(,_,^,^ 

k=p+l ^ 



-e)T{k + 2) 

and then, since a„_i = ((— 0)„/n!)e~*("~^)''*'', 



n-2 p 



e)n 



n-2 



UxAO) = E ^np E ^n-, + ^ E ^"'"'^ E ^« 



p=0 i=0 

( 



+ 



!i^-i{n-l)X, 



n\ 



p=0 
n-1 

j=0 



3=0 
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n-1 p 



n—2 p 

p=0 j=0 

Uln + U2n- 



-e)n 



n\ 



p=0 j=0 



We proceed to show that the U.n are of the stated order. First, for Uin we 
have 



(21) 



■'np 



0(min{|p|;^"Mp|;''"2^s"H) 



uniformly in p = 0, . . . , n — 1 and s = 1, . . . , m. Equation (21) holds because 

n— 1 /?t— 1 \ 



1 + 



•'np 



^e-^^= ;^ fc-^-v^^= + o 5:^ 



-6»-3 



and 



k=p+l 
n-1 

p+i 

n-1 

p+i 



\p+i 



n-1 

p+1 



<(p+ l)"^~^max 

AT 



0{\p\l'~\s~'). 



p+N 

p+i 



Next, 



(22) 



EiJ2un-j 



(p+l)£(l-U|/(p + l))7,=0(N+), -/j^EutUt+j, 



for p = 0, . . . ,n— 1, and it follows from Minkowski's inequahty that 

/ / '«/« n \ 2\ 

=0 en;'"'^'+ Ep"'^'/^-"M 



vp=0 

1-26* ^20-1-, 



p=n/s 



e(-ii) 
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For U2n, we rewrite the sum as 

n-l p 



u. 



2n 



-9)r 



n\ 



p=0 j=0 



li< 



■n-j 



_n\ n n 

-^ E e^^"-^)'= E ^"-.- 



n! 



-e\ 



n\ 



n—p=l 
n k 

■ E ^fc E ' 
fc=i (?=i 



n—j=n—p 



pigAs 



n 






fc=l 






n! 1 — e 



iXs 



fc=l 



n! 1 — e' 



^|[/2„|2 = 0(^1-29^-2) follows from (22) and E\wuiXs)\'^ = 0{1) [Robinson 
(1995), page 1637], and the stated resuh follows because s"^ < s'^^~^. 

We move to the proof of (a2). Define ap = {d)p/p\ so that X„ = X]p=o o.p'Un-p + 
Xq. Similar to the above, summation by parts gives 

Xn — Xq — C{l)X^ 

n-~2 p n-1 

= E("p ~ "p+i) E(^'«-i ~ C'(l)en-i) + On-i E(^n-i ~ C'(l)e„_j). 

p=0 i=0 j=0 



c- _ _ r(d+p) 

smce Op Op+i — r(<i-i)r(p+2) 

result follows if 



d-l^ 



0{\p\'^') and ap = 0{\p\'!^'), the stated 



(23) 
Now 



E 



^{Un-j -C{l)en-j) 
■j=0 



o{p) asp^KX). 





E 


rp n2 
Lj=0 J 


E 


^(l)>>n-i 

L i=o J 




9 

=0 


P 
1=0 J 



Y.{p+l-\j\)7,, 
j=-p 

oo 

(p+l)C(l)2 = (p+l) J2 7j' 
C(l)i? 



p oo 



j=-oo 



7 . 7 . Cr£n-j-r 2_^ ^n-l 
.j=Or=0 1=0 
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p p oo 



j=0l=0r=0 



= C{l)Y,{p + l-r)cr, 

r=Q 

and it follows that -E[X]?=o(^"-i ~ C{l)en-j)]'^ is equal to 

p 

-ip + i) J2 ^j-^Y.nj 

(24) '^'^^-^ ' 

+ 2C(l)(p+l) J2 Cr-2Cil)J2rCr, 

r>p+l 1 

which is o{p) from X^^^oo Tj; X^cf ^r ^ ^^ ^^^ Kronecker's lemma. Therefore, 
(23) and the stated result follow. 

(b) Let M be a generic finite positive constant. We collect some facts that 
are used repeatedly: for a £ (— 1, C) and q>2, 

g V^ q 

(25) EOogO"' < (log 2)-^ 5: + (ilogg)^^^ < Mq{\ogqr\ 

1=2 2 ^ 

1=0 yg 

(26) 

<Mg'"+^(logg)-2. 

Proceeding similarly to the proof of (al), we obtain 

UxAG) - C{l)exA9) = Um + f/sn, 



where 



n-2 p 

Uln = X] Kp^{Un~j - C{l)en-j), 
p=0 j=0 



n\ 1 - e*^ 
and bnp is defined in (21). 



^' ' (27rn)i/2[t/;„(A,)-C(l)u;,(A,)] 
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,n- 1, 
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First, we show that, uniformly in p = 

p 
E " 



(27) 



n2 



^{Un-j -C{l)en- 
.j=0 



= 0(|p| + (log(p + 2))-"). 

When p = 0, (28) follows immediately. When p > 1, from (24) the left-hand 
side of (27) is equal to 

p 

-(P + 1) E 7j-2Ei7j- 
|i|>p+i 1 

+ 2C(l)(p + l) E Cr-2C{l)J2rCr. 

r>p+l 1 

The first and third terms are bounded uniformly in p by p{log{p + 2))~^ 
from Assumption 5'. For the second term we have 



p 
> >7i 


= 


p p 

>;>;7. 


1 




i=i k=j 



=oij2iiog{j + i)r'j 

= 0((p + l)(log(p + l))-^) 

uniformly in p, where the third equality follows from (25); X^i '"Cr = 0{{p + 
1) X (log{p + 1))~^) follows from the same argument, and (27) follows. 
From Minkowski's inequality, (21) and (27), (£^|f7i„p)-^/^ is bounded by 

n-2 

E|bnp|Hf(log(p + 2))-^ 

p=0 



:f~''/\s-Hiog{p + 2))-'' 



= O (n/s)i/2-^(log(n/s))-2 + {log{n/s))-\s''J2\P\+'~'^' 
\ n/s J 

= 0(nV2-e/-V2(iog„)-2)^ 

where the third line follows from (26), and the fourth line follows because 
(log(n/s))-2 < (log(n/m))-2 = 0((logn)-2); E\U2n\^ = 0{n^~^^s-'^) follows 
from (27) and £;|u;„(A,)-C(l)u'e(A,)|2 = 0(1) [Robinson (1995), page 1637], 
giving the stated result. □ 





/n/s 








n 





\p=0 


-'/\\o 


g(p + 


2))- 


p=n/s 
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Lemma A. 6. Under the assumptions of Theorem 4.2, we have, for j = 
l,...,m, 

' ^ ■'^ ^ ^ ^ ■'^' \0{n~^logn), /or /3g [1,2]. 

Proof. The proof essentially follows from Theorem 3.15 of Zygmund 
[(1959), page 91]. An elementary calculation gives 



Ewu{Xj)wl{Xj)-C{e'^^)/27r 

[C{e'^)-Cie'^^)]K{X-Xj)dX, 



(28) ^ 

~ 2^ 

where K{X) = (27rn)-i J2i Ei e*(*-^)^ is Fejer's kernel. From Zygmund [(1959), 
page 90], |i^(A)| < An"U"2 and \K{X)\ < An for a finite constant A. As- 
sumption 6' implies \C{e'^) - C{e'^^)\ < C\X - Xjl"^^^^^^^ for [A - Aj| < d/2 
and large enough n. Therefore, if we split the integral (28), each part is 
bounded as follows: 

+ / =0[n^^ X~^dX 

-TT JXj+5/2 \ Js/2 

= 0{n-'), 

Aj-l/n /-A, +5/2 / .5/2 

Aj-5/2 JAj+l/n V Jl/n 

0(n-'3), for /3e (0,1), 

0(n-i log n), for /3e (1,2] 



and 

= 0(n A"^""^^'^^dA 

Aj-l/n V ./O 

= 0(n-™'^^^'i>). 

Hence, Ewu{Xj)wl{Xj) — C(e*'^j)/27r has the stated order; EIu{Xj) — fui^j) 
has the same order by a similar argument, and the order of 

E\wu{X,)-C{e'^^)we{Xj)\^ 

= E[Iu{Xj) -2Re[wu{Xj)C*{e'^^)we{Xj)] + 27r/4A,)/e(A,)] 

follows, n 
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Lemma A. 7. Let vt = I{t > 1} and A^^ut = (1 - L)~"ft with a>0. 
Then uniformly in 1 < s <m with m = o{n) the following hold: 

(a) w^~c.^{Xs) 



1 



n 



(b) wt<^{Xs) 



1 - e*^« r(a + 1) ^2^ 
1 n"' 



1 - e*^» y/2 



■KU 






Proof. For part (a), first consider the case a S (0,1]. From Lemma 
A.l(b) 

(29) 

- (1 - e'^^)-^e'^'^-'^Vn/V2^. 



For a = 1, since i(;t,(As) = it follows that 

^A-i.(A«) = -(1 - e'^')-^e'^^A-\n/V2^ 

From 



-(l-e*^0"^e^^'^/^2^- 



(d)fc {d)k-i _ T{k-l + d) _ {d-l)k 
k\ {k-l)\ T{d-l)T{k + l) k\ 

and the fact that (a — l)o/0! = (q)o/0! = 1 we obtain 

t-i 



'{a)k (a)fc-i 



fc=o 

E 
fc=i 



/k! (fc-l)! 



+ 



(a)o (a)t- 



0! (t-1)! 



Hence, for a G (0, 1) from Lemma A. 2 we have 



1 Y^ {a)t^i itx, 
^(t-lV 



^2^*"^ (t-1)! 



jAs 



V2 



■KU 
iXs 



Dn{e'^^;-a) 



(a)n 



n! 



\/2irn 
Then the stated result follows because 



[(l_e*^»)-« + 0(n"s-')]. 



A--^;„ 



(a + 1) 



n-l 



1 



(n-l)! r(a + l 



-n"[l + 0(n-i)], 
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so that the second term on the right-hand side of (29) dominates the first 
term. The result for a > 1 is derived from (29) and by induction. 
For part (b), observe that 



W^-c.^{Xs) 



e^^' g (« + l), ^,,,^ 



V27rn 



t=o 



+ 



t\ 



i\s ^-^ 



V27rn \/2 



TTTl 



E 



1 



r(a + i; 



-e + o{t 



a~l\ 



= e'^^T{a + ly'wt^Xs) + 0(n"-i/2), 
and the required result follows from part (a). D 

APPENDIX B 
Proofs of theorems. 



„it^3 



Proof of Theorem 3.1. For notational simplicity we assume Xq = 
throughout the proof, but the result carries over for general Xq with Xn — Xq 
replacing Xn- We follow the approach developed by Robinson (1995) for the 
stationary case. Define G{d) = Gom"! Y.T \f-^'^° and S{d) = R{d) - R{do). 
For arbitrarily small A > 0, define @i = {d:do — ^ + ^^ d< A2} and 02 = 
{d: Ai < d < do — ^ + A}, possibly empty. Without loss of generality we 

assume A < ^ hereafter. In view of the arguments in Robinson (1995), d ^ do 
if 



sup|r(d)|A0 and Pr infS'(d)<0 

01 \02 , 







as n — > 00, where 



T{d) = lo| 



Go 
G{d) 



log 



Gid) 



log —J2^ 



2d-2do 



m 



2d~2do 



m 



2{d-do) + l 



(2d - 2do 



— ^^ogj- (logm- 1) 



m 



Robinson (1995) shows that the fourth term on the right-hand side is 0{logm/m) 
uniformly in d E ©1 and 



(30) 



sup 

61 



2(d_do) + l ™p-^2d-2.o 



m 



m 



1 



O 



1 



m 



2A 
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(31) 
Let 



23 



sup 

ei 



1 Gido) ^ G{d) 
log -^ log 



Go 



G{d) 



0. 



m 



m 



B{d) 



2(d-do) + l ^^j-x2d-2do 



m 






from which it follows that 

G{d) - G{d) A{d) 



G{d) 



B{dy 



Gido) G{d) (B{d)\ fBido) + A{do) 

log -7; log 7773V = log 777TT + log 



G(] 



G{d) 



B{do) 



B{d) + A{d) 



By the fact that Pr(| logFj > e) < 2Pr(|y — 1| > e/2) for any nonnegative 
random variable Y and e < 1, (31) holds if 



(32) 



sup 
Oi 



B{d)-B{do) 



sup 
ei 



B{do) 
B{do)-B{d) + A{do)-A{d) 



B{d) + A{d) 



and 



0. 



For do £ (^) 1)) from the arguments in Robinson [(1995), page 1636], supQ^ l^(<^)| 
is bounded by 



(33) 



m—l 

E 

r=l 



m 



2A 



+ 



m 



^[Af°I.(A,)-Go] 



^[Af°/.(A,)-Go] 



Define Dnj{d) = (1 - e*^0"^^i-D„(e*^J;6'). Then from Lemmas A.2 and A.3 
we have 



(34) 



|D„,(d)|2 = l + 0(A|) + 0(/-2) 
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uniformly in j = l,...,m. Hereafter let I^j denote Ix{Xj), let Wuj denote 
Wu{\j), and similarly for other d.f.t.'s and periodograms. Now 

Af »4,- - Go 

(35) = Af °4, - \Dnj{do)\^Iuj + [\Dn,{do)\'' - fu{0)/fu{Xj)]Iu 



'nj 



+ [lu, - |C(e*^Ol'4,]/.(0)/A(A,) +/n(0)(27r/,, - 1). 

From Lemma A.l(a), the fact that \\A\^ - |B|2| <\A + B\\A- B\ and the 
Cauchy-Schwarz inequality we have 



(36) 



< E 



I. do 



2Dnjido)w^ 



A"° Uy 



•J Jx,n['^0)+e'^^Xr, 



uj 



1 - e'^^ 



V2 



X E 



Xf U. 



x,n{eo) + e'^^ X„ 



1 - e*^J 



V2 



nn 



vrn 



1/2 



1/2 



with ^0 = 1 ~ do. From (34), Lemma A. 5(a) and EIuj = 0(1) [Robinson 
(1995), page 1637] the right-hand side is 0{j'^°~^), giving 



I , \m/ r^ 



J2[\f°Ic.j-\Dnj{do)\^Iuj] 



= 0(m°'o~^+m-2^1ogm). 
For any r/ > 0, (34) and Assumption 1 imply that n can be chosen so that 

\\Dnjido)\^ - fuiO)/UXj)\ <V + OiX]) + 0(j-i/2), i = 1, . . . , m, 
and from Robinson [(1995), page 1637], we have 

E\Iuj - |C(e*^Ol'^..l = OiJ-'/\logiJ + l))i/2), j = l,...,m. 
It follows that 

m / X 2A 1 r 
E(-) ^Y.\i\Dn,{dor-fu{0)/fu{X,)]In 



'MJ 



+ [4,-|C7(e^^OI%-]/«(0)//«(A,)l 

= Op{r] + 'm n~ -\-m~ logm). 

Robinson (1995) shows Y.Tir/m)'^^r~^\ El(2vr/ej - 1)| ^ 0. Using the same 
technique, we can show that the second term in (33) is Op(l), giving sup0^ 1^(^)1 
0. 
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For do = 1 first observe that 



A{d) 



Xl 2{d-do) + l^f j^^"-^"^^ 
27rn m 



■E - 



1 



2{d-dQ) + l 



m 



^Vmy 



\ 2d-2do 



v2 
\2do J f~i ^n 



From Lemma A. 1(b) we have 






(37) 



\2c(o T 

^^ "^ 27rn 



^ ;Iuj + 0(A2^ 



|l_giA7J2"«J ' "'^'"J^27rn y/2^ \l-e'^^\^ 



The results in Robinson [(1995), page 1637] imply that 

(38) E\wuj-C{e'^^)wej\^ = 0{j-Hog{j + l)), j = l,...,m. 

Using a similar decomposition as (35) and the results thereafter, with (37) 
and (38), we obtain 



E 



(39) 



^(Af °4,- - 27r/„(0)/e, - Xl/{27rn)) 
Lj=l 

= 0{rr] + r n^^ + r ' logr). 



for l<r <m. In view of (30) it follows that 



sup \A{d) - X^/{2-Kn)\ = Op{r] + m^n-^ + m^^^ logm) + Op(l). 
Oi 



and (32) fol- 



Finally, observe that (30) gives supg^ \B{d) — Gq\ = 0{m^ 
lows. 

Now we consider 02 = {d: Ai < d < d^ — -^ + A}. In a way similar to 
Robinson [(1995), pages 1638 and 1639] we have 

Pr^inf 5(d) < o) < Pr f^Ba, - l)Af «/., < o") , 
where p = exp(m~^ X)™ log j) ~ m/e as m ^ oo and 



/ (j7p) 



2A-1 



for 1 < j <p, 



^ \U/p) '^'^° ^: ioT p < i <m, 

m 

^Oj =0{m), 
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yjflj = 0{m 



2-4A> 



J2ajf°^^ =0{m^~^^logm + m' 



do^ 



II«ii 



-1/2 



0{m 



1-2A> 



Applying (35) and (37) and proceeding as above in conjunction with the fact 
above and m~^ J2T{^j ~ l)(27r/£j — 1) ^ [Robinson (1995), page 1639], we 
obtain 

- E(«. - 1)^^^^. = [Go + ^i{do = 1}) - E(% - 1) + «p(i)- 

Choose A < l/(2e) < 1/4 with no loss of generality. Then for sufficiently 
large m we have m'^ J2Ti^j — 1) > <^ > and hence 



Pr 



(^E(«.-i)\''"^^.<o)-o 



as n — > oo. Therefore, d — > do, giving the stated result. 

For the limit of G(d), recall G{d) = G{d)+A{d)G{d)/B{d), d^ do, G{d) ^ 
Go and B{d) -^ Go- The required result follows because 



sup 
Oi 



AM- ^7(4 = 1} 



and X^/{2TTn) = Go{X^)^ + Op{l) -^ GoXi from a standard martingale CLT. 

D 

Proof of Theorem 3.2. Define G{d) = Gom'^ J2T >'f~'^ and S{d) = 
R{d) - R{1). For < A < i define Gi = {d: ^ + A < d < A2} and 62 = 
{d: Ai < d < 2 + A}, possibly empty. Then, by the same line of arguments 
as above, d ^ 1 if 



sup|r(d)|A0 and Pr inf S'((i) < ^ 
01 V02 



as n — > 00, where 



T{d) = log 



G(l) 



Go 
+ (2d-2; 



log 



ni 



G{d) 
G{d) 



log -E 



•2d- 2 



77T, 



2d-2 



2(d-l) + l 



^logj- (logm- 1) 



supejT(d)|AOif 

(40) 

Let 

A{d) 

B{d) 
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sup 
ei 



G(l) G{d) 

log — ; log 



Go " G{d) 



2d-l^/i\2d-2 ^_ 



TTi ^ \m 



m 



■E 
1 



2do \2do J 



2d-l^ ^/jx2d-2 



m 



Then a little algebra shows [G{d)/G{d)] = {2TT/n)'^~'^'^o\^A{d)/B{d)], giving 



log 



^(1) ,„ G(d)_,_/i?(d) 



log 



Go ° G(d) 

Therefore, (40) holds if 



log 



log 



fA{d)\ 



(41) 



sup 

ei 



A{d)-A{l) 



A{1) 



and sup 
ei 



Go J "°U(i)y 

B{d) - Go 



Go 



0. 



We proceed to approximate A{d) by 



Aiid) 



2d-l^/j^2d-2 



m '-r \m 



2-2dox2do|i JA,|-2 ^n 

27rri 



E£ /-^-"AfH-e 



= (27r)2'^o^3^(l)2?ii-2do(x^)2 + Op(l) uniformly in d G Oi, 

where the second equality follows from 

^•2-2do^2do|i _ g*A, |-2 ^ ^2vr)2'^0-2^2-2do(i + 0(A|)), 

(30) and Lemma A.5(a2). For do G (1)|]) from Lemmas A. 1(a) and A. 5, 
similarly as in (36) we obtain [Z)„j(d) is defined in the proof of Theorem 
3.1] 



E 



■2-2do \2doT _ ■2-2do \2do|i _ JA,|-2 ^n 



< E 



■l-do 



giAj 



X \E 



j'^''''Dnj{do)Wuj-j 



ji~doxfuxU0o) + '^e'^^Xn 



V2 



Tin 



1/2 



^■l-do^do^^^^(g^)2\l/2 



1 - e*^J ^2^ 



vrn 
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It follows that supQ^ l^('^) ~ ^i('^)l = Op{m^ '^'■' + m ^'^), and uniformly in 
01 we have 

A{d)-A{l)_ Op(l) 



^(1) 



(2vr)2do-3(^(l)2„l-2do(X^)2 +Op(i) 
Op(l) 



where yt = n ' ''((io)n-tet/(^ — t)- Assumption 3 implies 

n 

Y,E{y!\Ft.i) -^<l>i= Tidor\2do - l)-\ 
1 

n 

J2Eiyp{\yt\>S})^0 for all <5 > 0. 
1 

Therefore, from a standard martingale CLT we have n"^'^~ "X^ — > A^(0, ^i). 
Thus, 



sup 

01 



A{d)-A{l) 



Ail) 



0, 



and supe^ li-^lc^) ~ Go]/Go\ ^ as before, thereby establishing (41). 

Next, consider 62 = {d: Ai < d < i + A}. Let p = exp{m~^J2T^ogj). 
Then S{d) = log{D{d)/D{l)}, where D{d) =m-^J2T(.J/P?'^~^fl'xj- It fol- 



lows that 



where 



inf D{d)>—y] ajfl^j , 



/ iJ/p) 



2A-1 



for 1 < j <p, 



Then 



■^ I iJ/p) ^; foi p < j <m. 

(42) Pr (^inf S {d) < OJ < Fi (^f^ (a,- - 1)^^^'''^ Af " 4,- < j . 

In view of the fact that Efoj = 0{m), J2T ajf~'^° = ©(m^-^^logm + 
'm~'^°), J2T ^jj~^ ~ 0{m^~'^ ), we obtain similarly to before 

1 m 



m 



1 *" y2 

l^(a, - 1)/-2^oa2'^o|i _ ,a,|-2_^ ^ (1) 
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(27r 



|2do-3^1-2dox2. 



m 



^(a,-l) + Op(l). 



1 



Since m ^X^i^li^^j — 1) > <^ > for sufficiently large m by choosing A < 

n ^ oo and 



l/(2e), we obtain Vr{m-^Y.T{aj - l)f'^'^nf°I:,j < 0) ^ as 

hence d 
we have 



hence d— > 1, giving the stated result. For do € (|,|], from Lemma A. 1(b) 



(1 



e'^^ }w 



XJ 



WA2 



d~l 1/2- 
n n ' 



^Xne'^^/V2^. 



Because E\waxj\'^ = ©(n^'^-^g-i) from AXt ~ I{d - 1) and n^/^-rfj^^ ^^^_ 
verges to a Gaussian random variable, the stochastic behavior of Wxj is 
dominated by X„. Hence, the required result follows from the same line of 
argument as above, and the results for larger do are derived similarly. D 

Proof of Theorem 4.1. We follow the same line of approach as the 
proof of Theorem 2 of Robinson (1995). Theorem 3.1 holds under the current 
conditions and implies that with probability approaching 1, as n — > oo, d 
satisfies 

(43) = R'{d) = R'{do) + R"{d*){d- do), 

where \d* — do\ < \d — do\- Now 

4[F2{d)Fo{d) - F^{d)] _ A[E2{d)Eo{d) - Ef{d)] 



R"{d) 

F^{d) 

-. m 

Ffc(d) = -^(logjfA|%-, 



m 



El{d) 

1 m 



\k-2di 



m 



J) J 



•^xj- 



As pointed out by Andrews and Sun [(2001), page 21], since Fk{do) = 
Op((log77i)'^) as shown below, we need to show Ek{d*) — Ek{do) = Op{n'^'^° {log m)~ 
rather than Op{n?'^) as in (4.4) in Robinson. Fix e > and choose n so that 
2s < (logm)2. Let M = {d: (logm)'^|d - do\ < e}. As in Robinson [(1995), 
page 1642] we have 



(44) 



FT{\Ekid*) - Ek{do)\ > (27r/n)-2'^«(logm 



< Pr(G((io) > (logm)^^^''/(2ee)) + Pr((logm)''|d* - do\ > e). 



The first probability tends to because G{do) -^ Gq. The second probability 
tends to if 



(45) 



sup 

01 



B{d)-B{do) 



B[do) 
Op((logm)' 



+ sup 
Oi 



B{do) - B{d) + A{do) - A{d) 



B{d) + A{d) 



-12\ 
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Using (35) and Assumption 1', we obtain, for 1 <r <m, 

i?|^(Af °4, - /„(0)27r/,,)| = Oir'^'^ + r^+^n-") 



and Robinson [(1995), (4.9)] shows that Ei(27r/ej - 1) =Op(rV2). j^^ ^^^_ 
junction with supg^ \B{d) — Go\ = 0{m~'^^) they give (45). It follows that 

B"{<f) = 4[F2{do)Fo{do) - F^{do)][F^{do)r' + Op(l) = 4 + 0^(1), 

where the second equahty follows from Fk{do) = Gqiti^^ Y^^^^^^iY + Op{{[ogm)~^) 
obtained similarly as A{dQ) — > 0. Next we consider the first term on the right- 
hand side of (43). Now 

m 

i/2ii'(do) = 2m- V2 ^ ^^. ^xf^j^. _ G^] [Go + Op{l)r\ 



m 



where Vj = log j — m^ J2T logJ ^^'^ J2T ^j ~ 0- Fi"om Lemmas A.l and A. 5, 
(34) and (38), we have 

mm m 

^ VjXf'I^, = J2 ^jlnj + (2vrn)-iz2 ^ ujXfo 1 1 _ g^^. | "2 + 2 Re[r„] + i?„, 
11 1 

where 



Tn = J2,^,D:^ido)C*ie'^n<j>^fa - e*"0~'C'(l)eA,„(^o)(2™)-^/' 
1 

and 

R^ = Op{{logm){m'^°~'^^'^ logm + ?n^/^(logn)-^ + (logm)^ + m^n~^)) 
= Op(mV2). 

Rewrite r„/C(l) as 

m 

Y,v,Dl^{d,)C*{e'^^)\f{l-e'^^)-\2^n)-' 
(46) ^ 

p=0 g=0 

Since £t is a martingale difference sequence, E\Tn\'^ is bounded by 

1 mm n— 1 

(47) -iEEi-.ii-'^i\'"~'Af-iEi^.^.,iiL.,,i 

j=lfc=l p=0 
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n-1 



n-1 



(48) 



(49) 



-1 lit / fc — _L / / 1 IL— ± 



-Afcgl 






p=0 



fc=l 



g=0 



+;^EEi-.ii-^iAf-^Ar^ 



i=ifc=i 



n-l _ _ 

X E I^A,p||6'-Afcp| 



and (47) and (48) are bounded by, respectively, 



n-l 
y^gig(Aj-Afc) 

q=0 



n-l 



(logm)2^^/"-ifc 



rfo-l/,do-l„-2do 



n 



^ IpI^'^o-^ ^0(n-im2'^"(logm)2), 



1 1 



logJTl^j 



rfo~lj2~'^0 



'■"/i 



n-l 



ENr' + E^'°^'^j' 



do-2 — 1 



0((logm) 



1 V „/j 

In view of the fact that Ya)"^ e'''^^^'^''^ = nl{j = k}, (49) is bounded by 



n-l 



(logm)27i-i^Af'"2^|p|f^"2^0(m2'^"-i(log7n)2), 

1 

giving Tn = Op{n~^''^m ° logrri + (log rri)^ + rn o~^''^ logm) = Op{m^''^). 

From Lemma A.5 and the fact that Y.T'^jf'^""^ = (24 - l)~^(2do 
2^j^2do-i _|_ 0(log7n), we obtain 

m 

iA, 1-2 _ ■::f^^^2do-l\„l~2do(Ye\2 



(2^n)-ix2^^.,Af°|l - e^^^|-2 = HGom 



n 



■(XD^ + Op(l)], 



whereH = (27r)2'^o-2(2do-l)"2(2(io-2). Robinson [(1995), page 1644] shows 
that J2T ^jluj = Go ET Vjhj + Op{m^l'^). Therefore, 

m 

1 

The first term on the right-hand side converges to a A^(0, 4) random variable 
by Robinson (1995). For do G (2' f)? ^^^ second term on the right-hand side 
is Op(l), and the required result follows. For do G (|, 1), we have 



m 



2~2do p/ 



R'ido) = 2En^~'''^{X^J^ + Op{l) = 2H^yJ +Op(l), 



where yt = n^'"^ "'''((io)n-t£t/('^ — t)l, suppressing reference to n in yt. Since 
E?i?(y?|i^t-i) ^ «5i = r(do)-2(2do - 1)-' and EiEyf = 0{n-'), from a 
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d 



standard martingale CLT we obtain Yyi Ut -^ -^(0, ^i), giving m"^ ^'^°R'{dQ) — > 
2H$ix? and the required result. When do = f , m^/^R'{do) = 2J2'l zt + 2E{Y,iyt)'^ + 
Op(l), where yt is defined above, zi = and, for t>2, 



Zt = et^esCt-s, Cs = 2n ^m ^/^^i/j cos(sAj 



s=l 



and S,t = {zt, yt)' form a zero-mean martingale difference array; hence X]i Ct -^ 
N{0, diag(l,$i)) if, for any nonrandom (2 x 1) vector a, 

n 

(50) J2 E[{a^tf\Ft.i] - a' diag(l, $i)a ^ 0, 

1 

n 

(51) ^^[(a'et)^/(|a'6|>'^)]^0 forall5>0. 

1 

Robinson shows J2i ^i^il^t-i) — 1 — > and Yli^^t ~^ 0- I'^ conjunction 
with Y^Eyt -^ 0, (51) is satisfied. Since Ei ^(y|l^t-i) ^ ^i, (50) holds if 

1 t=2 ^'^'' ^>- s=l 

The term in the middle has mean zero and variance bounded by 



n n min{t-l,M-l} 

i=2 M=2 s=l 



2 1 

n t— 1 M— 1 

+ 2ni-2'io ^ 1^ _ i|^.-i ^(„ _ ^)do-i ^ ci_,c,_,. 

3 2 1 

Robinson [(1995), page 1646] shows Cg = Cn-.s, \cs\ = 0{'m~^''^s~^logm) for 
l<s<n/2, \cs\ = 0{m^/^n^^logm), and Yli c'i = 0{n~^ {log m)^). There- 
fore, the first term on the right-hand side is bounded by n~^(logm)2, and 
the second term on the right-hand side is bounded by 



^ n \ 1/2 n i-1 / t-1 \ 1/2 

Ec? n^-^'^o^|n-t|^^5:(---)'"-M E -n 

^ 1 / 3 2 Vt-u+l / 



: O (logTTi) m ' n 



2rr,-V2„l/2-2do 
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X 



^ 1^ _ t|^.-i ^(n - n)"'«-i((t - u)-i/2 + \n- t\l^/^) 



3 2 / 

= 0(m~^/2(logm)2), 
giving (50), thereby completing the proof of the theorem. D 

Proof of Theorem 4.2. We follow the approach and notation of the 
proof of Theorem 4.1. First, from Assumption 1' (39) is strengthened to 



(52) 



E 



Y.iXf'^hj - 27r/„(0)/,, - X2/27rn) 



= 0(r^+^n^'^ + r^/2logr), 
for 1 < r < m. It follows that 

sviv\ A{d) - Xl/2TTn\ = Opim'^n-'^ + m-'^^\ogm), 

and thus (45) holds. Since G((io) = Gq + X^/(27rn) + Op(l), the first proba- 
bility in (44) tends to and i2"(d*) = 4(^2(^0)^0(^0) - Ff (do)][-^o((^o)]"^ + 
Op(l). Using (52) and the fact that Ylii'^'^^sj ~ 1) = Op{r^''^)^ we obtain 



ftW-(G„ + |i 



m 



Ea°gi)' 



Op^m'^n ^(logm)^+m ^/^(log?n)^), 



givingi?"((i*)A4. Now 



m 



'^'R'ido 



2m-i/2Er^jAf°4, 
G(c?o) 
2m-V2Er^.Af°4, 



Go{l + n-^X^r) + Op{l)- 



The numerator is equal to 

m 

2?n-i/2^i/j(l + 0(A2))|u;„j- - e*^^X„/^/2^|- 
1 

m 

= 2m"^/^ E z^j/„j - 2m"^/^(Xn/\/2^)2 
1 

+ Op{m^'^n^'^ logm) 

m 

= 2m^ 1/2(^0 XI '^i [2^^£i ~ 1] 



Xz^,e '^^m 



iA. 



L 1 
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m 

- 2m^i/2(^on^^/^^^^i^j2Re[V2^W£j] +Op(l), 

1 

where the third hne follows from Robinson [(1995), (4.8)], Lemma A. 6 and 
Assumptions 1' and 6'. It follows that 

"^ ""^'^^^ i + (E?..)^ + o,(i) ' 

where yt = n~^''^et, Xt = n^'^StCt and zt and q are defined in the proof of 
Theorem 4.1. Therefore, Wn = J2i{zt,yt,xty-^Wr^N{0, diag(l,l,2)) if 

n 

(53) 5]£;[(zt,2/i,rEi)'(zt,yj,xt)|Fi_i]^diag(l,l,2), 

1 

(54) J2Ezf + j:Eyf + J2Ext^0. 

Ill 

We have already shown J2i ^^t + X^i ^yf — > in the proof of Theorem 4.1. 
Since 



Y^Ext = n'j:^: 



n 
,4 
t 

1 



n/m oo N 

4 



= 0{n ^2^'m^{logm) +n /_] wi "s (logm^ 

\ 1 n/m ' 

= 0{n~ 7n[logm) ), 

(54) holds. To show (53), E?^[(^t,yi)'(^i,yt)l^t-i] ^ diag(l, 1) has already 
been shown above, and 

n 

1 

n/m \ 2 

= 4n~ m" ^ ^ Uj cos(sAj) 

m n 

= 4n~-^m^^^t'|^cos^(sAj) 



+ 2n ^m ^^^i/jZ^fc^[cos{s(Aj + Afc)} + cos{s(Aj - Afc)}] 
i^fc 1 



m n 



: 2n-^m-^ ^ ^mi^ + cos(2sAj)] 
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since Y.T ^"j ~ "^- Furthermore, X)? E{xtyt\Ft-i) = Y.i Q = and 

n n i— 1 

1 t=2 s=l 

because the right-hand side has mean zero and variance 

n n min{t-l,«-l} 

n^Ct^Cu ^ Ct-sCu-s 

t=2 u=2 s=l 

n t—1 n t—1 u—1 

2 1 3 2 1 

The first term is 0(n~^ (log 77i)^), and the second term is bounded by 

/ T) \ ■^ / TJ \ 



" Eic^i E^i 



\ 1 / \ 1 / 

= 0[(log?n) I 771 ' log m + m '' log m^Js 

V V n/m J J 

Thus (53) holds and Wn — > W. Therefore, from the continuous mapping 
theorem 

conditional on W2, and unconditionally 

m^/\d -do)^ N{0, 1(1 + 2/i2)(l + /i2)-2)0(/j) dh, 
J —00 

where (/>(•) is the standard normal p.d.f., giving the stated result. D 

Proof of Theorem 5.1. The argument follows the approach of the 
proof of Theorem 3.2. First we consider the case a < do — ^. Since a > -^j 
df) > 1 must hold. Let 



m ^ \m ) 1 1 — e^^i I ^ 27rn 



1 



Mi) = '-^Y.[i] 'iT-!iw^\i'-'' + '^''^l 



+ Op{\) (uniformly for 0i) 

4[Cln°-'^«+^/2^C2iV(0,l)]^ 
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for generic nonzero constants Ci and C2, where the second equahty and 
convergence in distribution follow by the same argument as before. Define 
the other quantities as in the proof of Theorem 3.2. Because 

supe^ \A{d) -Ai{d)\^0 follows, giving (41); Pr(infe2 S{d) < 0) ^ is ob- 
tained similarly and we establish (i — > 1 . 
Next consider the case a > do — 2 ■ I-^^fine 

M^} = ^i:(i^) "'-'-Aji,, 

m ^ \mj 

A (d) = ^^^Y^fLV''" 2 \' 1 

-^ (2'/r)~ /_f uniformly for 0i, 

and define the other quantities as in the proof of Theorem 3.2. Then it 
follows that 

G{d)/G{d)=n'^''-^A{d)/B{d), 

and supe, \T{d)\ ^ follows if supej[^(d) - ^(1)]/^(1)| ^ 0. Since 

E\n'-^''Xp,, - X]\l - e^^^r2(27r)- V'l = 0{r'^^+jn-' + n^o-^^-), 

supe^ \A{d) - ^i(d)| ^ follows, giving (41); Pr(inf02 S{d) < 0) ^ is ob- 
tained similarly, and we establish d — > 1 . D 
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